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Abstract 

A nonlinear generalisation of Schro dinger's equation is obtained 
using information-theoretic arguments. The nonlinearities are con- 
trolled by an intrinsic length scale and involve derivatives to all orders 
thus making the equation mildly nonlocal. The nonlinear equation 
is homogeneous, separable, conserves probability, but is not invari- 
ant under spacetime symmetries. Spacetime symmetries are recovered 
when a dimensionless parameter is tuned to vanish, whereby linearity 
is simultaneously established and the length scale becomes hidden. 
It is thus suggested that if, in the search for a more basic founda- 
tion for Nature's Laws, an inference principle is given precedence over 
symmetry requirements, then the symmetries of spacetime and the 
linearity of quantum theory might both be emergent properties that 
are intrinsically linked. Supporting arguments are provided for this 
point of view and some testable phenomenological consequences high- 
lighted. The generalised Klien-Gordon and Dirac equations are also 
studied, leading to the suggestion that nonlinear quantum dynamics 
with intrinsically broken spacetime symmetries might be relevant to 
understanding the problem of neutrino mass(lessness) and oscillations: 
Among other observations, this approach hints at the existence of a 
hidden discrete family symmetry in the Standard Model of particle 
physics. 
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1 Physics Through Inference 



A conceptually appealing and mathematically elegant method for deriving 
the form of probability distributions in statistical mechanics is Jaynes' prin- 
ciple of maximum entropy pQ. The Gibbs-Shannon entropy (or information) 



is maximised under given constraints to determine the form for the prob- 
ability distribution p(x). For example, if the mean energy of the system 
E = J e(x) p(x) dx is specified, then introducing the Lagrange multiplier f3 
and maximising Iqs — fiE with respect to variations in p(x) gives the well 
known canonical probability distribution p(x) oc exp(— j3e{x)). In physics the 
quantity (Q) is associated with the names of Bolzmann and Gibbs and appli- 
cations are usually restricted to statistical mechanics. However the form (Q) 
was derived independently by Claude Shannon j2] in his search for a measure 
that could be used to quantify the information content, or uncertainty, in a 
system 

It must be emphasized that the word "entropy" used in relation to Shan- 
non's information measure is an historical accident: It does not necessarily 
refer to the concept as used in statistical mechanics but rather to the more 
general idea of uncertainty. The principle of maximum entropy, or maximum 
uncertainty, is currently used as a method of inference in various fields of 
study, far beyond its original applications jSlllj- 

If one already has some a priori information about the system in the 
form of a reference probability distribution r(x), then the relative-entropy or 
Kullback-Leibler information 



is the relevant measure 2 that has to be maximised under the given constraints 
so as to obtain explicit forms for p(x). The negative of the Kullback-Liebler 
measure is also called the cross-entropy and is actually a distance measure, 
measuring the distance between the two probability distributions p(x) and 
r(x). If there is no useful a priori information then r(x) can be taken to be 
a uniform distribution and the Kullback-Liebler measure then reduces to the 
Gibbs-Shannon entropy. 

2 The reader is alerted to differing sign-conventions and terminology in the literature. 
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Thus the maximum entropy/uncertainty principle is a method of inference 
whereby one chooses probability distributions that provide the most unbiased 
description of the state of the system, since maximising the entropy measure 
acknowledges our ignorance of a more detailed structure. Since quantum me- 
chanics also involves probabilities, it is natural to wonder if the Schrodinger 
equation, which describes the evolution of probability amplitudes, too can 
be derived using a method of inference. An attempt in this direction was 
made by Frieden but the full time-dependent Schrodinger equation was 
obtained only recently by Reginatto [7j. 

Consider the Schrodinger equation for a particle in one dimension, 

The Madelung transformation [B] ip = ^Jp e lS l h can be used to write the 
Schrodinger equation in terms of two real functions, p and S, called hydro- 
dynamical variables, 



(4) 



dt~^~ mdx\dxj ^' ^ 




where 



h 2 1 d 2 ^p 
2m ^Jp dx 1 

n 2 (2dip {d xP r 

8m \ p p 2 



Q = -~-Zf («) 



(7) 



is called the quantum potential [HUH]. For Q = 0, the equation (jlj) is just 
the Hamilton- Jacobi equation for a classical ensemble of particles described 
by a probability distribution p(x, t) and with the function S(x, t) related to 
the velocity of a particle by v — — The equation © is then just the con- 
tinuity equation, equivalent to the expression for conservation of probability 
in quantum mechanics. 
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Reginatto notes that the classical limit of equations (J4l5j) . that is when 
Q = 0, may be obtained by minimising the action 

through a variation of both p and S. The full quantum equations follow if 
the variation of the action <&a is restricted so that a particular measure of 
information, the Fisher information 3 , 

is also minimised. That is, minimising the linear combination <£>a + £If with 
respect to both p and S and choosing the Lagrange multiplier £ = ft 2 /8m 
gives precisely the equations ()4I5|) which are equivalent to the time-dependent 
Schrodinger equation. It is crucial to note that the quantum potential © 
arises solely from the variation of the information measure Ip. Thus one 
may argue that the method of inference allows one to supplement classical 
mechanics with the required fluctuations to arrive at quantum mechanics. 
The method works as well for many particles in higher dimensions jjj- 

The Fisher information measure ^2] is not commonly used in physics but 
is well-known in statistics where it is used in studies of parameter estimation 
[Hj 4 . Here is a brief description of the measure. Suppose one wishes to 
form an estimate of a parameter 9 by making some measurements, y, in the 
presence of an additive intrinsic noise x, that is y — 9 + x. If p[x) is the 
probability distribution of the noise, then the Cramer- Rao bound relates the 
variance of an unbiased estimate of 9 to the Fisher information If, 

Var(a;) > (10) 
If 

with 

*-/;(=)'*■ 

3 This differs from below because in quantum mechanics the probability depends 
on the parameter t in addition to the variable x. 

4 The first few chapters of Ref. ^31 provide a concise introduction to Fisher information 
while the rest of that book develops a very different approach from that discussed in [J] 
or this paper. 
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Minimising the Fisher information therefore implies choosing the probability 
distribution that maximises the mean-square error in the estimation of 9, 
so that the distribution is one that is as unbiased as possible. Thus the 
principle of minimum Fisher information is similar in spirit to the principle of 
maximum entropy and Reginatto's derivation of Schrodinger's equation may 
be viewed as an extension of the method of inference to quantum mechanics. 

As an aside, the approach of using the Hamilton- Jacobi equations and 
the Fisher information sheds light on some puzzling aspects of quantum me- 
chanics. Firstly, since the real equations (|4I5|) have a natural interpretation, 
it suggests that the use of complex numbers in quantum mechanics is merely 
a convenience allowing one to combine the two real and nonlinear equations 
into one linear equation. Secondly, the form of the Fisher information mea- 
sure suggests a reason [T3J as to why probability amplitudes play a funda- 
mental role in quantum mechanics: The Fisher information term Ip becomes 
a polynomial function when written in terms of p 1 ^ 2 , 



Thirdly, the success of the approach in explicitly using ensembles and infer- 
ence to derive the Schrodinger equation, suggests that the wavefunction ip 
does not represent reality but rather our limited knowledge of reality (the 
inference bit) and that it represents a virtual ensemble rather than a sin- 
gle particle (the ensemble bit). These issues related to the interpretation of 
quantum machanics are still often debated in the literature 011011111; they 
will not be discussed further in this paper. 

However the derivation by Reginatto, though enlightening, also creates 
a puzzle: Why is the inference method in statistical mechanics based on the 
information measure (0) while that used in obtaining Schrodinger's equation 
based on a different measure (GJ)? If, as discussed in |T3j, the use of the in- 
formation measure (J2J) is well-motivated in statistical mechanics, then what 
is the motivation for using the Fisher information measure in the derivation 
of the Schrodinger equation? If there are several information measures, how 
does one decide a priori on which to choose? Is there a universal measure 
of which the various measures are special cases? These questions are inves- 
tigated, to various degrees, in this paper. 

It turns out that there is a close relation between the two measures 
and (JSJ) R m eq.(J2J) one chooses the reference distribution r(x) to be 




(12) 
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the same as p(x) but with infinitesimally shifted arguments, that is r(x) = 
p(x + Ax), then to lowest order, 



So to lowest order, minimising the Fisher information is the same as maximis- 
ing the relative entropy for two probability distributions that are close to each 
other, r(x) = p{x + Ax). In other words, to lowest order, one is indeed using 
the same information measure in both statistical mechanics and quantum 
theory, the main restriction in the quantum mechanics derivation being that 
one must compare a probabilty distribution with one that is slightly shifted. 
This restriction is not too difficult to motivate, for example one might guess 
that Ax represents the intrinsic uncertainty in quantum mechanics measure- 
ments, or that it represents the resolution at which the coordinates become 
distinguishable (These possibilities will be revisited once estimates are made 
on Ax later in the paper). 

But if one pursues this line of reasoning, then it suggests that the usual 
Schrodinger equation might only be an approximation, and that a general- 
isation would result if one used the full relative entropy given on the left 
hand side of (fT^j) rather than just its lowest order approximation that is 
the Fisher information measure: This is the essence of what is explored in 
this paper, leading to a nonlinear Schrodinger equation. As mentioned ear- 
lier, the quantum potential arises only from a variation of the information 
measure, so that changing the information measure will result in a modi- 
fied quantum potential which is then the sole source of the deviation of the 
nonlinear Schrodinger equation from its linear counterpart. In particular the 
continuity equation (j3J) is unchanged so that p = ip*ip is still conserved and 
has a sensible interpretation as probability density. 

Although many nonlinear generalisations of Schrodinger's equation have 
been obtained before, a partial listing is in |15j . none has been motivated us- 
ing information-theoretic arguments. On the other hand, various information- 
theoretic arguments have in fact been used by some authors to justify the 
structure of conventional quantum mechanics within the context of certain 
axioms [TSj. Of course the axioms can be questioned and that is one avenue 
where nonlinearity might enter those approaches. Indeed in the paper by 
Summhammer there is just a hint of the posibility that the linear superposi- 
tion principle might be an approximation. 

The plan for the rest of the paper is as follows. In the next section 



Ikl{p(x),p(x + A(x)) 
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the nonlinear Schrodinger equation is derived using the full Kullback-Liebler 
relative entropy for a single particle in one space dimension and its various 
properties discussed. Difficulties appear when the derivation is extended 
to higher space dimensions in Section (3) with the result that rotational 
invariance seems to be unavoidably broken. In Section (4) estimates are 
made on the length parameter which controls both the size of nonlinearities 
and also the scale of symmetry breaking. Also discussed in Section (4) are 
the links between causality and linearity and connections to other derivations 
of the Schrodinger equation in the literature. 

In Sections (5,6) the Klien-Gordon and Dirac equations are examined 
within the information theoretic framework confirming the intimate link be- 
tween nonlinearity and breaking of Lorentz symmetry. The relativistic equa- 
tions are then used to study the possibility of effective energy dependent 
masses generated by nonlinearities and applications to neutrino physics dis- 
cussed. 

The choice of information measure is re-examined in Section (7). Condi- 
tions that a physically acceptable information measure should satisfy for use 
in deriving quantum dynamics are discussed. The simplest measure satisfy- 
ing all the conditions is a regularised version of the Kullback-Liebler measure 
containing one free parameter 77 in addition to the length scale L that controls 
the symmetry breaking and size of nonlinearities. The apparently privileged 
role played by the Fisher information measure in linear quantum theory is 
discussed in Section (8). 

The main results of this paper are summarised in Section (9) and some 
ways in which the nonlinear theory might be tested experimentally are high- 
lighted. The paper ends with a discussion of various perspectives that one 
may take in deducing the laws of physics. A short appendix discusses quan- 
tum fields. 

2 Nonlinear Schrodinger equation for a single 
particle in one dimension 

The arguments of the last section invite one to use the distance measure 

$ B = -I KL (p(x),p(x + L)) = ( p(x)\n P< " X \ dx dt, (14) 

J p(X + L) 
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instead of the Fisher information, and to minimise the action $^4 + £§b by 
varying both S and p. Note that the measure (fT3|) is positive definite jH] , the 
Lagrange multiplier £ has dimensions of energy and L is a length parameter. 
The equations of motion that follow are 



dp 18/ dS\ 
dt m dx \ dx J 

9S_ + Jj_f9S\ 2 + v + q _ o 
dt 2m \ dx J 



where 



Q 



INL 



£ 



In 



p(x) 



+ 1 



p(x — L) 



(15) 
(16) 

(17) 



p(x + L) p(x) 

is the generalised quantum potential that replaces Q. The two real equa- 
tions can be combined into one complex nonlinear equation thorough the 
identification tp = ^Jp e lS / n , 



dip 



h 2 d 2 i) 

2m dx 2 



+ V(x)ip + Fi(p)^ ■ 



with 



Fi{p) = Q 



£ 



INL 



hi 



-Q 

p(x) 



p(x + L) 



+ 1 



p(x — L) 
p(x) 



h 2 1 d 2 ^p 
2m yjp dx 2 



(18) 

(19) 
(20) 



and p(x) = ip*(x)ijj(x). The energy scale £ and the length scale L are not 
independent as we need to recover the usual Schrodinger equation to lowest 



order. That is, to lowest order £& B = jf-Ip (or equivalently Qinl = Q to 



lowest order) so that one obtains through eq.([13j) the relation 

SI?-* 

Am 



(21) 



This equation has a natural interpretation. If a particle is localised to a region 
of size L then its momentum would be at of order h/2L and its energy would 
be ~ h 2 /8mL 2 which is consistent with (|2ip. Amusingly then, contained in 
the kinematics of the nonlinear Schrodinger equation is a a relation (J21)) that 
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is essentially the uncertainty principle of quantum mechanics. A critical point 
to note here is that if £ is finite, then L cannot be zero unless fi vanishes. 
Thus the length scale is intimately connected to quantum mechanics. A more 
detailed discussion of the posible physical interpretations of £ and L follows 
in Section (4). 

The nonlinearity of the generalised Schrodinger equation (|18|) is thus due 
to the nonvanishing of F\ (p) which measures the deviation of the generalised 
potential Qinl from the usual quantum potential Q. Expanding in L, one 
obtains 



Fi{p) 



n 2 L r {p'f p'f 



Am 



+ 



0{I?) , (22) 



3p 3 2p 2 

where the prime denotes derivative with respect to x. Interestingly, with 
appropriate boundary conditions, the space-time average of the new quantum 
potential Qinl, 

I P Qinl dx dt = £ f p(x) In ^ —r dx dt (23) 

J J P\X ~\- Lij 

is precisely the information measure used in the action $^ + This 
relation is the generalisation of that found by Reginatto relating the average 
of the usual quantum potential to the Fisher information measure [7]. 

Here are some of the properties of the nonlinear Schrodinger equation 

1. The nonlinear equation still admits plane wave solutions, x/j = e ikx ~ lu)t 
because Fi(p = constant) = 0. 



2. If ip is a solution of the equation, then so is for any constant A 
because Fi{\ 2 p) = F\[p). This allows solutions to be normalised, the 
importance of which is discussed in Section (7). 



3. The continuity equation (JT5|l still holds by construction so that p(x) = 
ip*ifj still has a sensible interpretation as probability. 



4. If two solutions ip^ and of the equation have negligible overlap 
then their superposition is also an approximate solution. This is be- 
cause for negligible overlap, F(p) = F(p^) or F(p) = F(p^). This 
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"weak superposition principle" ( Bialynicki-Birula et.al. [15] ) will be 
used in a later section to constrain other possible information measures. 



5. For stationary states that are normalisable, one gets by substituting 

ip = e~ lEt l h (j)(x) and averaging, 

£ = -|^<J^> + < V(x) > + <F 1 (p)>, (24) 

so that the nonlinearity induced energy shifts are caused by an excess 
of energy in the modified quantum potential over the normal quantum 
potential. 

6. In the linear theory the kinematical content, represented by the state i/j, 
is separate from the dynamics which is determined by the Hamiltonian. 
These two aspects become entwined in the nonlinear theory as the 
Hamiltonian becomes state-dependent through F(p). 

Most of these properties hold also for various other nonlinear equations 
that have been constructed in the past The potential singularities in 

(|2*U|) can be avoided by using a regularised measure to be discussed in Section 
(7.1). 



2.1 Parity preservation 

The nonlinear term Fi(p) is not invariant under the parity transformation 
x — > —x. However this is an artifact that is easy to remedy. Notice that 
in (|TH|) A(x) can be either positive or negative but in ([14]) and the sequel it 
was implicitly assumed that L was positive. Therefore, in order to obtain 
a parity- invariant equation one should use a symmetrised form of Eq. (jl4]) 
involving both postive and negative values of L. Such a symmetrised measure 
is equivalent to the use in information theory of the J-divergence (3J |3] . 
The net result is the replacement 

FM^\{K{P) + F{{P)) (25) 

where Fi is the same as F\ but F{~ uses — L instead of L. Nevertheless, it 
is more useful for latter discussions to consider a more general case whereby 
the Fi are of the same form as F\ but instead of L they involve respectively 



10 



iL 1 * 1 , where are two different length scales. Then one has parity violation 
of order (L + — L~) with parity invariance only when L + = Lr . The relation 
(121)1 must be replaced in the general case by 

£ (L+)* + (L-)^jl 

2 4m ( J 

For brevity, in latter constructions and discussions the focus will be on 
simple measures like (|14)) that violate parity maximally, as the more general 
case of parity conservation or small parity violation is easily obtained using 
a symmetrisation like ()25j) with the symbols having a meaning as highlighted 
following that equation. In order to avoid possible confusion with other forms 
of parity violation, the parity violation due to the length parameters will 
sometimes be referred to as "geometric parity violation". For the parity 
invariant case, one has instead of the estimate ()22)) the next term of order 
L 2 . 

There is no problem in extending the nonlinear Schrodinger equation 
to more than one particle in one space dimension. However complications 
appear in moving to more than one space dimension even for a single particle. 
These are discussed in the Section (3). 



2.2 Periodic zero modes and localised states 



Consider the free (V = 0) one-dimensional nonlinear Schrodinger equation 
(|18|) for a wavefunction ip = e~ lEt ^ h (p(x) with real, 



E 



h 2 d 2 



2m(j) dx 2 
p(x) 



£ 



In 



p(x) 



£ 



In 



p(x + L) 



p(x + L) 
p(x — L) 
p(x) 



+ 1 - 



p(x — L) 
p(x) 



+ 



h 2 d 2 ^p 



2rriy/p dx 2 



(27) 



(28) 



The kinetic energy term (first term in (J27)0 cancels with the quantum po- 
tential Q (last term in ()27)l ) to leave only the modified quantum potential 
Qinl as the sole determiner of the eigenvalue E. This result looks odd as 
one is used to having a differential equation for the eigenvalue E, but note 
that if one expands Qinl i n the small parameter L then the usual kinetic 
term reappears with higher derivative corrections! 

The eigenvalue equation (J28)) has an exact solution: For any <fi(x) that is 
periodic in L, (p(x + L) = 4>(x), the right-hand side vanishes and so E = 0. 
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Thus the nonlinear equation supports stationary states that have zero energy 
and are periodic with a period equal to the length L which sets the scale of 
nonlinearities. These periodic solutions exist for the exact equation (}2*H|) but 
not for the perturbatively approximated equation obtained by expanding and 
truncating Qinl to some power in L. 

Nonlinear wave equations often have localised solutions. A detailed anal- 
ysis of (jTSjl needs to be performed to see if it supports solitary waves but 
here a heuristic discussion based on the quantum potential will be sketched. 
Consider a Gaussian wavepacket ip ~ exp(— x 2 //3 2 ). The usual quantum 
potential for this state is 

which gives rise to a repulsive force, —dQ/dx = h 2 x/m(3 4 ' > 0, that causes the 
wavepacket to disperse. The nonlinear quantum potential Qinl is equivalent 
to a force 



dQiNL 2SL 
dx = 



l-exp{-(L 2 -2xL)//? 2 } (30) 



which is attractive for small x. Thus there is a possibility that F(p), which 
is an effective potential in the nonlinear Schrodinger equation, is attractive, 
potentially leading to nondisperive solutions. On dimensional grounds, it 
would appear that these solutions, if they did exist, would have an extension 
on the scale L. 

The discussion here has been for the case of maximal geometric parity 
violation. At the other extreme where parity is preserved, the zero mode 
solution still exists but the analog of (J3UJ) gives a repulsive force. 

Assuming that a solution exists to the nonlinear equation whose form is 
slighly deformed from that of a plane wave, one can use dimensional analysis 
to obtain an estimate of the modification to the dispersion relation of a free 
particle. For the case of maximal geometric parity violation one obtains from 
(ED, 

, . h 2 L l 

where L is the characteristic scale in the problem such as the deBroglie 
wavelength h/k. The symbol k is used to denote momentum (not wavevector) 
to avoid confusion with p which represents probability density. Hence the 
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lowest order correction to the standard k 2 /2m relation might be expected 
to be k 3 L/4mh. On the other hand for the case where parity is preserved, 
one obtains L 2 k A /Arrih as the correction to the standard dispersion relation. 
The length scale L will be estimated in later sections. 

3 Several particles in many dimensions 

The Schrodinger equation for N particles in d + 1 dimensions is 

h 2 



ihip 



if) (32) 



where i,j = 1,2, , dN and the metric is defined as = Sij/mu) with 

the symbol (i) defined as the smallest integer > i/d. That is, i = l,...d, 

refer to the coordinates of the first particle of mass mi, i — d + 1, 2d, 

to those of the second particle of mass vri2 and so on. The overdot refers 
to a partial time derivative and the summation convention is used unless 
otherwise stated. As in Ref.[7], it is assumed that one is using Cartesian 
coordinates as otherwise the interpretation of the term Q in the variational 
action becomes unclear within the information-theoretic context. On the 
other hand this is not an artificial restriction as the correspondence principle 
between observables such as momenta and their operator representation is 
unambiguous only in Cartesian coordinates 0. 

As usual, writing if) = ^Jp e tS ^ h decomposes the Schrodinger equation into 
two real equations, 



p + gijdiipfyS) = 0, (33) 
S + f di S dj S + V-^ 9 J^-^) = 0. (34) 



The term with explicit h dependence in the last equation is the usual quantum 
potential. Equations ()33I34|) follow from a variational principle [7j: One 
minimises the action 

$ = J p 

where 



S + ^diSd.S + V 
2 



dx Nd dt + $ F (35) 



8 J p 
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is the relevant Fisher information measure for several variables. Since the 
metric is diagonal, no mixed derivatives appear in the Fisher information 
$z?. Recall that the diagonality of the metric is a consequence of the form 
of the kinetic term in the Schrodinger equation in Cartesian coordinates 
(132)1 . Thus the structure of the measure <&p for several variables is fixed 
by the conventional kinetic term of the Schrodinger equation once one goes 
beyond one space dimension. The number of particles involved introduces 
no essential modification. 

Following the strategy used in the previous section, one replaces the mea- 
sure &f by an appropriate generalisation of the Kullback-Liebler measure 
such that it reduces to $^ at lowest order. Consider first the attempt 

/ dx Nd dt p( Xi ) In PiXl) - « ^ / dx Nd dt ^ll + 0(L 3 ) .(37) 
J p(Xi + Li) 2 J p 

The lowest order term on the right-hand-side is not of the form $p. This is 
simply because the quadratic term in the Taylor expansion of a multivariate 
function generally contains mixed derivatives while, as emphasized above, 
$f does not. Therefore it is necessary to arrange matters so that no mixed 
terms appear in the expansion of a suitably generalised Kullback-Liebler 
information measure. A little reflection shows that each coordinate degree 
of freedom must be treated separately. Consider therefore 

Nd 



$ KL = f dx Nd df g f (x) ln (3g) 
J f^l Pi{ x ) 



where p%{x) = p(xi, X2, , Xi + Li, x^d) means that the probability den- 
sity has only its z-th coordinate shifted by a length parameter Li and where 
Si are parameters with dimensions of energy. §kl is therefore a sum of 
measures, one for each independent degree of freedom. Expanding, one has 

fe-E^/^ + 0{Lf. (39) 

i P 

which is similar in form to <&p. This similarity can be made into an identity 
by relating the length and energy parameters (as in the single particle case) 

h 2 

SiLf = (no sum on i) . (40) 

4m (i) 
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Having identified the correct measure §kl, one can use it in (J35j) to 
replace Qp. The variational principle then gives a quantum Hamilton- Jacobi 
equation which is equivalent to the following nonlinear Schrodinger equation 



with 



ihip 
F(p) 

Qnl 

Q 



h 2 



i> + F(p)i> 



Qnl — Q , 

Nd 



ln^ + 1-^ 



Ti 1 



-9ij 



P+i P 
2didjp dipdjp 



P 



P 



i i La 



■ %Nd) 



(41) 
(42) 
(43) 

(44) 
(45) 



as defined after eq. (|38|) . As before, the nonlinearity is determined by F(p) 
which measures the deviation of the generalised potential Qnl from the usual 
quantum potential Q. Expanding in L iy one obtains 

h 2 Li \ (dip) 3 dip dfp 



F(P) = E 



Am 



(i) 



3p 3 



2p 2 



0{L*) 



(46) 



The d + 1 dimensional multiparticle nonlinear Schrodinger equation also 
has the properties listed for the single particle equation in the last section. 
Quite remarkably, the equation ()41|) is also separable: If = ip^ip^ repre- 
sents the composite wavefunction of two independent systems, V — V\ + V2, 
then it is easy to check that F(ip*ip) = F(V'* (1) '0 (1) ) + ^ 1 (V'* (2) '0 (2) ) so that the 
nonlinear Schrodinger equation decouples into two independent equations. 

It is important to note that separability was not demanded at the outset 
but is a consequence of requiring that &kl reproduce the Fisher measure 
$f at lowest order for a single particle in higher dimensions. That simple 
requirement fixes the structure of $kl even for the multiparticle case and 
the consequent separability of the equation. As in the one dimensional case, 
potential singularities in the equations of motion can be avoided by using a 
regularised measure to be discussed in Section (7.1). 

Also, as for the one-dimensional case, the nonlinear term in equation (j41|) 
violates parity. The general situation is obtained by the replacement 



F(p)^-(F + (p) + F-(p)) 



(47) 
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in (|4ip. where the F ± are of the same form as F but instead of Li they 
involve respectively ±Lf . Then one has parity violation of order (Lf — L~). 
Correspondingly the relation (}4T)j) gets replaced by 



In the parity symmetric case the lowest order term (|46p vanishes and the 
first surviving term is of order L\. 

4 Lorentz symmetry breaking 

Even in the parity symmetric case, the nonlinear equation (}4~Tj) is not ro- 
tationally, and hence Galilean, invariant. Therefore in the relativistic case 
the equation will not be Lorentz invariant as discussed in Section (5). This 
symmetry breaking is almost apparent from the form of Qnl where each co- 
ordinate is shifted separately, and is explicit in a perturbative expansion of 
F(p). The rotational/Lorentz invariance is broken by the necessarily nonzero 
values of length parameters (even if they are the same), giving space(time) 
a cellular structure. 

There is currently no evidence for such a structure, or violation of space- 
time symmetries, but several independent lines of enquiry [T71 [TH] have sug- 
gested that at sufficient high energies, probably close to the Planck scale, 
spacetime might not be smooth and our usual spacetime symmetries might 
be violated. 

Let us recapitulate the line of reasoning that has led us to the nonlinear 
Schrodinger equation with broken symmetries. It was driven by the desire 
to use a common measure of information for inferring the proabilistic laws 
of physics. The relevant measure in the maximum entropy principle is the 
Kullback-Liebler information. Reginatto showed that the Fisher informa- 
tion measure was relevant for deriving the usual (linear) Schrodinger equa- 
tion. However the Fisher measure is an approximate case of the Kullback- 
Liebler information and using the latter measure gave rise to the nonlinear 
Schrodinger equation. The breaking of continuous spacetime symemtries in 
the nonlinear equation then appears as an unavoidable consequence of using 
the Kullback-Liebler measure in the inference procedure. 

In previous derivations ^2] of nonlinear Schrodinger equations, Galilean 
invariance has been considered a necessary constraint, just as symmetry con- 




(48) 
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straints have traditionally been used in constructing Lagrangians in high 
energy physics. Indeed without a plausible alternative guiding principle, 
it would have been difficult to abandon symmetry constraints; see however 
[T9"] . In this paper an inference principle has been used as the the main 
guide, leading to a nonlinear generalisation of the Schrodinger's equation 
and the consequent symmetry breaking. The results lead one to propose 
that the currently observed spacetime symmetries and the apparent linearity 
of Schrodinger's equation are two related phenomena and the potential future 
violation of one phenomenon at small length scales should be accompanied 
by a violation of the other one too. Given the likelihood that our observable 
spacetime is an emergent structure, this then suggests that the linearity of 
conventional quantum mechanics will be broken at scales where there are 
deviations from the usual spacetime symmetries. 

In passing, it is noted that in low energy physics applications where a 
nonlinear quantum equation might be used for phenomenological reasons, 
the symmetry breaking might be natural in some circumstances. 

4.1 Scale of Lorentz symmetry breaking 

As noted in the discussion following (|2~Tj) , the length L is dependent on % and 
so it is tempting to identify L with the Planck Length 5 , L p = (hG/c 3 )? ~ 
10 _35 m. However let us consider first various alternatives with larger length 
scales. 

As discussed earlier, the relations (j21| l4*U|) have the form of the uncer- 
tainty principle for a particle localised to a region of size L. Therefore one 
possibility to explore is whether S might be some energy intrinsic to the par- 
ticle. If one chooses £ oc mc 2 then from ()21|) L is essentially the Compton 
wavelength A c = h/mc. However this conflicts with the hydrogen atom spec- 
trum: Consider the energy correction to ground state of the hydrogen atom. 
Since the ground state is spherically symmetric, the perturbative energy cor- 
rection < F{p) > calculated using vanishes because of the odd number 
of derivates in the L 3 term. Hence the lowest perturbative correction would 
be the next term of order L 4 giving, on dimensional grounds, 



The explicit powers of h do not match in L and L p but there could be some implicit 
dependence in L. 




(49) 
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where ao = Ti 2 /me 2 is the Bohr radius ~ 10~ 10 m. Since the ratio A c /ao = 
a = 1/137 is the fine-structure constant, therefore AE ~ mc 2 a 4 which is of 
the same order as the fine structure correction to the hydrogen atom, and so 
much too large. Thus S cannot be the rest mass energy of a particle. 

Another possibility is that L is the size of fundamental particles to which 
strict quantum mechanical considerations apply. The classical electromag- 
netic radius of an electron is R e = e 2 /mc 2 ~ 10~ 15 m. Using L = R e and 
combining PUJ) with (pfUj) gives AE ~ mc 2 a e which is somewhat smaller than 
the Lamb shift and hyperfine structure corrections. The actual size of the 
electron, if it has any well-defined extension at all, is smaller than 10~ 18 m 
and so there is a remote possibility that L might be related to the size of 
elementary particles. 

Various low-energy atomic physics experiments searching for signatures 
of Lorentz violation have determined that the energy parameters in effective 
lagrangians measuring deviation from Lorentz invariance must be tiny 
|18j . "typically" of order 10~ 27 GeV. One can now make a crude estimate for 
L by assuming AE in (I49|) to be 10~ 27 GeV and using the relation (J21j) for 
the electron. This gives L ~ 10~ 20 m which is much larger than the Planck 
length of 10 -35 m. Of course as experiments get better the estimate might 
get pushed to the Planck value after all. Other estimates for L are discussed 
in a later section. 

Bear in mind that the nonlinear Scrodinger equation of this paper sug- 
gests a simultaneous violation of both the linearity of quantum mechanics 
and a violation of spacetime symmetries. These entangled effects might com- 
plicate, in the present context, the interpretation of experiments that have 
been searching for only one violation or the other. Some experiments that are 
searching only for Lorentz violation have been mentioned above. There have 
also been experiments searching for nonlinearities in quantum mechanics (as- 
suming the usual spacetime symmetries). However most of these experiments 
have been motivated by Weinberg's particular class of equations [T3] to look 
at spin systems. The nonlinearity that has been found in this paper is related 
however to spatial variations in the wavefunction and so more experiments of 
such type would be welcome. Nevertheless if spacetime symmetries are indeed 
violated, then this should be taken into account in the effective Hamiltonains 
used for interpreting the data of the experiments in [20] • That is, it might be 
useful to reanalyse the past data in the context of possible Lorentz violation. 
Interestingly, the limits on energy parameters [2D] describing possible nonlin- 
ear deviations in quantum mechanics (assuming spacetime symmetries hold) 
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are roughly the same order of magnitude as the experiments searching only 
for Lorentz violation in E| • This coincidence probably simply reflects 
the current sensitivities of the experiments. 

When should one expect the nonlinear quantum corections to become 
large? Roughly, this should happen when the length scale L of the nonlin- 
earity is comparable to the relevant length scale of the linear theory. Taking 
for simplicity R e for L, then for the hydrogen atom the nonlinear effects 
should be large when R e ~ ao which reduces to a 2 ~ 1, that is, in the regime 
where the running fine structure constant is large. More generally one would 
expect the nonlinear corrections to be large in electrodynamics when L ~ R e 
is comparable in size to the Compton wavelength of the electron A e ~ h/mc, 
leading to a ~ 1, again in the strong coupling regime. 

In short, L might conceivably be much larger than the Planck length and 
perhaps related to the size (if any) of elementary particles. Indeed, what 
the form of the Kullback-Liebler relative entropy is possibly suggesting 
is that L is the resolution at which the spatial coordinates become distin- 
guishable — and this might be due either to the fuzziness of space at short 
distances or a small but finite extension of elementary particles, or both. 

The discussion so far has assumed that the length parameters Li are 
constant, and for simplicity one may even take them to be the same Li = 
L. However a more general viewpoint would be to consider the parameters 
to be dependent on position, L^x), and not just on the particular axis in 
configuration space that is chosen. The discussion can proceed as before if 
the Li(x) are kept inside the integrals and, for example for a single particle 
of mass m, the constraint pUj) generalised to 



This requires for consistency that h 2 /m be space dependent 6 . However for 
the Schrodinger equation to be derivable from a variational action of the form 
(JHSJ), h should be kept constant. With a position dependent mass m{x) and 
a symmetrised form for kinetic energy operator, P 2 m( x ) Pi the derivation of 
Ref. [2j goes through as discussed in [21]. Then the nonlinear generalisation 
as discussed in this paper can take place. 

Position dependent masses might be relevant for describing the effective 
dynamics at short distance scales where space is not expected to be smooth. 

6 Another possibility is to promote the Lagrange multiplier £ to a background field 




(50) 



S(x,t). 
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In Sections (5,6) it will be seen that in the relativistic case quantum non- 
linearities can sometimes mimic effective spacetime dependent masses even 
when the Li are constant and there are no explicit mass terms. Also, in the 
relativistic case, one can make time dependent if one so desires. 

4.2 Causality and Linearity 

Weinberg's general study of nonlinear quantum mechanics stimulated a large 
amount of research into the subject. Some of the early theoretical investiga- 
tions concluded that nonlinear quantum mechanics would allow unphysical 
effects such as superluminal signals (221- However those studies implicitly 
assumed for nonlinear quantum mechanics some of the usual structure of 
the linear theory. Within a larger framework a nonlinear theory might be 
consistent after all On the other hand, working within certain assump- 
tions, other studies [21] have suggested that there is indeed some connection 
between causality and the linearity of quantum mechanics. 

The nonlinear theory obtained in this paper explicitly violates spacetime 
symmetries, and it has been argued that this is indeed unavoidable within 
the information theoretic framework. Thus the conclusions of this paper 
are qualitatively consistent with previous studies. In particular, since the 
spacetime symmetries have been explicitly broken, then any possible effects 
due to such violations cannot be considered pathological unless ruled out by 
experiments. 

4.3 Connections 

Over the years there have been many approaches to understanding the usual 
linear Schrodinger equation. In Nelson's [2H] derivation, quantum mechanics 
is the result of a type of Brownian motion while in Smolin's work [2H] , Nel- 
son's basic idea is married to matrix models. As one can see from Ref . [26"] . 
the extra term that is obtained there because of the nondifferentiability of 
paths is precisely the usual quantum potential, which as noted in Ref. [7], and 
as discussed above, is equivalent to the Fisher information measure in the 
variational action. Thus a generalisation of these works to yield nonlinear 
quantum mechanics of the type considered in this paper would mean that de- 
viations from Brownian motion must be considered. Of course in the model 
of [2Zj nonlinearities of a different kind might also arise when some of the 
approximations in those derivations are relaxed. 
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Another recent derivation of the linear Schrodinger equation used the 
idea of adding fluctuations to the classical Hamilton- Jacobi equation [2H] 
and then constraining those fluctuations through an exact uncertainty re- 
lation [21]. Mathematically, the result is again the emergence of the usual 
quantum potential and thus the linear Schrodinger equation. In this ap- 
proach a generalisation to a nonlinear equation might be possible through 
the use of generalised uncertainty relations. 

Studies of nonlinear quantum mechanics in the literature typically assume 
that the usual spacetime symmetries are still valid while studies of spacetime 
structure at small scales typically assume that the usual quantum mechan- 
ics is valid. There have been a few speculations that both linear quantum 
mechanics and the large scale structure of spacetime might be emergent quan- 
tities derivable from a more fundamental theory, see for example Ref. [30]. In 
[26J, it was also suggested that both macroscopic spacetime and linear quan- 
tum mechanics might have a common origin. The equation (jU), derived in 
this paper using an information theoretic principle, displays explicitly a link 
between spacetime symmetries and the linearity of quantum mechanics and 
so might be an intermediate candidate for the abovementioned search for a 
unified theory of quantum matter and spacetime. 

A word on spin. One may also decompose the Pauli equation in terms 
of hydrodynamical variables [TO]. There appears now in addition to the 
quantum potential Q of the spinless Scrodinger equation, another quantum 
potential Q s related to spin degrees of freedom. The nonlinearisation of Q 
can proceed as before, but Q s has no obvious connection with information 
theoretic measures. The resulting nonlinear Pauli equation would therefore 
contain the same extra term F(p) as the spinless Schrodinger equation. A 
different approach would use the result of Ref. jHI] where Reginatto combines 
Q with Q s and relates the result to a generalised Fisher information. One 
might attempt to nonlinearise that equation using an information theoretic 
approach. A third approach is discussed in Section (6). 

5 Klien-Gordon equation 

The single particle Klien-Gordon equation was also derived in Ref. [7j. The 
nonlinear extension is discussed here with some change in notation from 
Ref. Hj . 

The Lagrangian density for the classical single-particle Klein-Gordon field 
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is written as 

£ = ffd^d^* + m 2 c 2 #* (51) 
which upon variation of the action gives the equation of motion 

- h 2 d^(f) + m 2 c 2 (j) = . (52) 

the metric is (— , +, +, +), xq = ct. The normalisations have been chosen so 
that when <ft = exp [— imc 2 t /K] ip/2m the nonrelativistic limit of (J52)) is the 
usual Schrodinger equation. Now substitute in <f> = ^Jpe %s ^ h into (JoTj) to get 

C = p ((d lt S)(& t S) + m 2 c 2 ) + ^(d^idPp) . (53) 

Varying the action / dx 3 dxo£ with respect to S gives the continuity equation 
d^ipd^S) = which implies the well known fact that p = is not the 
conserved quantity even though it is positive definite. Although p(x) is not 
a probability density, it will be used as a measure of likelihood so as to give 
the last term of (J53*j) the interpretation of a Fisher information density. 

In the information-theoretic approach of y[I there is a neat separation 
in the equations of the classical part from the quantum fluctuations that 
are later inserted through the principle of maximum entropy. Such a pro- 
cedure is quite clear for nonrelativistic dynamics but some ambiguities in 
interpretation and other technical difficulties appear in trying to apply the 
procedure to relativistic systems and spin degrees of freedom. The following 
additional rule is adopted to guide one in the various constructions: A gen- 
eralised nonlinear equation at a higher level (e.g. relativistic) should reduce 
to the nonlinear equation at the lower level (non-relativistic) in some limit 
(e.g. large c.) The use of p = (fxp* is an example of this as we shall see. 

Thus one replaces the Fisher measure (the space-time integral of the last 
term of (J53j) ) by the more general KullBack-Liebler information measure 
which is similar in form to the one used in the non-relativistic case, 



/ dx^dxojZet (3 lP (x)ln^\, (54) 



where as before pi(x) means that p(x) has only its Z-th spacetime coordinate 
shifted by a length parameter Li and where e\ are parameters with dimensions 
of energy times mass. The parameters $ = (—1, 1, 1, 1) are the diagonal 
elements of the metric. The form (|54j) ensures that to lowest order in L; 
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the usual Klein-Gordon equation will be recovered. Furthermore noting that 
Xq = ct and that Xq + L — > t + L Q / c then in the nonrelativistic limit (|54j) 
reduces to the Kullback-Liebler measure ()38|) and the Schrodinger equation 
is recovered as required. The parameters e\ and L\ are related by 



2 



no sum on 



(55) 



Compared to the nonrelativistic case (|21|) the mass does not appear in this 
relation because of the use of the conventional form of the Klien-Gordon 
lagrangian (|51jl: its dimensions have been absorbed into the definition of e\. 

As before, the equations of motion that follow from the generalised La- 
grangian are nonlinear with the nonlinearity due to an excess of the new 
quantum potential over the usual quantum potential: 



K'd^cj) + m 2 c 2 + F(p)(f) = 
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Tip) = Qnl — Q 

3 

Qnl = e $i 

1=0 

k 2 3 

s = -7EA 



1=0 



ln^ + 1 

P+l 



2didip 
P 



P-i 
P 

dipdip 
p2 



Expanding in L;, one obtains 
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The equations above violate parity and time-reversal invariance maxi- 
mally because of the nonlinear correction which is asymmetric in L[. The 
general situation is obtained after the replacement 



Hp)^^ + {p) + ^~{p)) 



(61) 



where the T are of the same form as T but instead of Li they involve 
respectively ±£f . Then one has parity and time- reversal violation of order 
[Lf — Lf). Correspondingly the relation (JS3j) gets replaced by 



e,((L+) 2 + {Li) 2 ) = h 2 (no sum on I) 



(62) 
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In the parity and time symmetric case the lowest order term (|6U|) vanishes 
and the first surviving term is of order Lf. 

The main result of using an inference method with the help of the Kullback- 
Liebler measure is now a nonlinear Klien-Gordon equation without Lorentz 
invariance 7 , just as rotational invariance was broken in the non-relativistic 
equation. This is easy to see in a perturbative expansion in Lj of the 
Kullback-Liebler measure. Thus space and time attain a cellular structure. 
However just as in the nonrelativistic case, plane waves with the usual dis- 
persion relation exist as solutions to the nonlinear equation in addition to 
other possible states with modified dispersion relations. 

One may interpret the nonlinear term T{p) in the equation of motion as 
an effective spacetime dependent mass generated by self-interactions. How- 
ever since F{p) vanishes when p is a constant, this possibility does not exist 
for plane waves = e . Assuming a deformation of a plane- wave is a solu- 
tion, then one can estimate the contribution of nonlinearities to an effective 
mass using dimensional analysis. Let us call this Amo, then for the case of 
maximal geometric parity violation, from (J561 !6(Jj) . 



where L is some characteristic length scale in the problem such as the Comp- 
ton wavelength, h/rriQC. In terms of masses the relation becomes, 



where M is the mass scale corresponding to the Lorentz symmetry breaking 
length scale L = h/Mc. The current [S2] relative uncertainty in the electron 
mass is ~ 10~ 7 . Using this for the left-hand-side of (|65j) implies the lower 
bound estimate M ~ 10 11 GeV corresponding to L ~ lCT 27 m . The proton 
mass has a similar relative uncertainty and if one assumes that the equa- 
tion may also be used for composite objects then a lower bound estimate of 
M ~ 10 14 GeV is obtained. This crude analysis suggests that the scale of a 
possible simultaneous parity and Lorentz symmetry breaking might be close 

7 However space and time translation symmetries are still preserved. 




(63) 



(64) 




(65) 
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to the GUT scale. In this respect, precision measurements of the masses 
of elementary particles, and their possible momentum dependence might be 
useful in determining the scale L. 

On the other hand for the case where there is no geometric parity viola- 
tion, but only Lorentz violation, one has 

Am fm \ 

(66) 



m \M 

which results in the lower bound mass scale estimates mentioned above being 
reduced by a factor of 1CT 7 . 

Instead of using the Compton wavelength for Lq, one might use the de- 
Broglie wavelength, h/k, and so obtain a modification to the dispersion re- 
lation of a relativistic particle due to nonlinearities. When there is maximal 
geometric parity violation one obtains from (J6Uj) , 

Hp) - ^ (67) 

(68) 



Mc 



as the correction to be added to the right-hand side of the usual E 2 /c 2 = 
k 2 + (mc) 2 . In the parity symmetric case the relevant correction is /c 4 /M 2 c 2 . 

The massless limit of the nonlinear equation (JHE1) is even more interesting 
as now the nonlinear corrections would imply some effective mass-like be- 
haviour; this is also true for the ultrarelativistic limit of low mass particles. 
The analysis is presented below for neutrinos. 



6 Dirac and Pauli equations 

The Fisher information measure, and the other information measures that 
contain it, are clearly related to quadratic kinetic terms as seen in Sections 
(1,2). On the other hand the Dirac equation is of first order. Therefore a 
provisional approach will be adopted to establish a nonlinear generalisation 
of the Dirac equation 8 : Following Dirac, the relevant Hamiltonian is defined 
to be the square-root of the Klien-Gordon Hamiltonian even in the nonlinear 
case. Thus 

ihip = \jk 2 + m 2 + F(p) i/j (69) 
8 A different approach will be discussed elsewhere . 
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where F(p) is an appropriate matrix generalisation of the nonlinearity F{p)- 
For example, one may take F(p) to be a diagonal matrix with elements 
J-'iPa) corresponding to the a-th spinor component. In this way each spinor 
component will independently satisfy the nonlinear Klien-Gordon equation at 
least when F(p) is time-independent. Note that p is the probability density, 
not to be confused with the momentum which has been denoted by k. Since 
for all practical applications the nonlinear terms will be small, then to a very 
good approximation, 



ihtp 



(a-k + (3m) + /,(/,) 



2(a • k + f3m) 



4>, (70) 



with a, (3 the usual Dirac matrices. 

The advantage of defining the nonlinear Dirac equation in this manner is 
that the link with nonlinear Klien-Gordon equation and also the usual Dirac 
equation is manifest. Furthermore, one may define the nonlinear Pauli equa- 
tion as the nonrelativistic limit of (fTUj) . Such a nonrelativistic Pauli equation 
might be relevant for interpreting results of low energy atomic physics ex- 
periments searching for nonlinearities or Lorentz violation [TH} 12()j. 

In the nonlinear Dirac equation one may approximately identify the positive- 
definite quantity ip^ip with the probability density as from (fTOj) one deduces 
that the usual conservation law is violated only by the small nonlinearities. 
Thus as usual there is an improvement over the Klien-Gordon equation. How- 
ever since the probability density is not a Lorentz scalar in the Dirac case, 
its use in the nonlinear term F(p) contributes additionally to the Lorentz 
violation that is already there. 

Potential applications are likely to involve coupling to background gauge 
fields. This can be done as usual through the prescription k — > k ^ — eA^ the 
nonlinear term F(p) is gauge-invariant. The gauge-invariance of the equation 
can then be used to establish conservation laws if required. 



6.1 Neutrino oscillations 

In the minimal Standard Model all particles except the neutrino obtain a 
mass through condensates. However the observation of neutrino oscillations, 
see for example [33] for a review, has suggested that neutrinos might have 
a mass much smaller than the other particles. It is usually stated that the 
origin of this mass is due to physics beyond the Weinberg-Salam model. An 
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alternative scenario for the phenomena is suggested here, involving physics 
beyond linear quantum theory. 

The neutrino's equation of motion is postulated to be of the form ([70)1 . 
Then even if there is no explicit mass term, m — 0, the nonlinearities will in 
general affect the propagation of neutrinos, thus possibly mimicing a energy- 
dependent mass. Thus it is suggested that the neutrinos are strictly massless, 
at least at the scale of the current experiments, and the apparent mass is due 
to quantum nonlinearities. These nonlinearities are naturally tiny since they 
are tied to Lorentz violation. For a crude estimate, use (JMj) and identify L 
with the deBroglie wavelength h/k of a massless neutrino propagating as a 
slightly deformed plane wave. Choose, for example, L to be the GUT scale. 
Let E = kc, then the effective energy-dependent mass of the neutrino in the 
case of maximal geometric parity violation is 

Am c 2 ~ KT 2 (E MeV)^ eV . (71) 

In (171 1) the energy of the neutrino is measured in MeV and the mass value 
is quoted in eV. For example, for E = 100 MeV, the result is 10 eV. If 
the Planck scale is used instead of the GUT scale then another suppression 
of 10~ 25 would result. Indeed the induced mass varies with the symmetry 
breaking mass scale as \/\fM, which is unusual compared to conventional 
"physics beyond the standard model" scenarios which assume quantum lin- 
earity. However for the case where there is no geometric parity violation, the 
effective mass scales as 1/M and at the GUT scale it is 

Am c 2 ~ 10" 11 (E MeV) 2 eV , (72) 

which is much smaller than (|71|) . 

Current direct measurements on neutrino masses have only placed upper 
bounds in the eV range, thus the actual mass could indeed be zero, with 
ambiguities caused by the unusual momentum dependence discussed here. 
It would be interesting to do a careful analysis of the energy dependence 
of the observed neutrino 'masses" and compare with the E 3 ^ 2 behaviour 
suggested above for the parity violating case. Note that for consistency, 
the perturbative estimate (|64|) is valid only when L <C L Q . For a 100 MeV 
neutrino, Lq ~ 10~ 15 m, so the condition is abundantly satisfied for L at the 
GUT or Planck scale. 

A mass-like term in the equation of motion of the neutrino could then 
explain, using conventional methods, the oscillations that are observed. How- 
ever a more natural and simpler explanation is that the oscillations are due 
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to the nonlinear terms coupling different neutrino species. In the minimal 
Standard Model the neutrinos of different flavours are arranged simply in a 
multiplet, if) = (ip e , ipp, ip T ) T , for example in the neutral current interaction. 
So it is natural to choose F(p) in ([70)) such that p = ift'if) rather than the 
form mentioned earlier after ([69)1 . This then couples the evolution of all the 
neutrino flavours in (|7Uj) so that even if initially only one flavour is present, 
it will seed the growth of another while it itself decreases, thus resulting in 
oscillations. 

The details of the above proposal have to be worked out [33] but one 
sees an obvious consequence of the above coupling: There is now a discrete 
(permutation) symmetry relating the neutrino flavours as they appear sym- 
metrically in p = and hence F(p). At scales where the nonlinearities are 
negligible the neutrinos would appear decoupled and the symmetry would be 
vacuous. This permutation symmetry thus seems to shed some light on the 
mystery of family replication. 

Remarkably, a search of the literature reveals that some of the qualita- 
tive but almost inevitable possibilities that have been mentioned above have 
also been arrived at by other authors, using very different motivations and 
arguments. The possibility of neutrino oscillations being due to Lorentz vio- 
lation has been studied by several authors independently, see for example the 
references in [35] ■ Kostelecky and Mewes have classified the possible Lorentz 
violating terms that can appear in an effective Lagrangian. Unlike this pa- 
per, they and others of course assume that linear quantum theory is valid 
even though Lorentz symmetry has been violated. Nevertheless a qualitative 
comparison reveals some striking similarities: In jSS] it was noted that novel 
and unconventional features are possible for neutrino oscilaltions caused by 
Lorentz violation, such as direction dependence and unusual energy depen- 
dence. These features are explicit in the equation of motion (J7DJ). 

The approach of [35] was to consider the most general effective Lagrangian 
that could describe neutrino physics in the presence of Lorentz violation. As 
a result they have many free parameters and possibilities. Here a definite 
proposal for the origin of Lorentz violations has been made, tied to quantum 
nonlinearities, and so hopefully the possibilities can be narrowed down for 
comparison with experiments. 

A possible discrete family symmetry in the Standard Model has also been 
noted by other authors through a study of the empirical neutrino mixing ma- 
trix [3B] • Ironically we have been led to a possible discrete family symmetry 
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through a study of the nonlinearity in the Lorentz symmetry violating Dirac 
equation. 

7 Uniqueness 

The Kullback-Liebler information measure satifies certain axioms that are 
considered suitable for an information measure [21 El El- Of course if one 
relaxed some of the assumptions then other information measures might be 
admittable. Here let us consider the possibility of constructing nonlinear 
quantum mechanics from some other common information measures which 
reduce, in the limit when some parameter becomes small, to the Fisher infor- 
mation measure, so that we are sure to recover the usual linear Schrodinger 
equation in that limit. 

One interesting measure is the Renyi relative entropy 0], 

R a (p(x),r(x)) = - ln[ / dx V a r x ~ a \ . (73) 

1 — a J 

As a — > 1 this reduces to the Kullback-Liebler measure while using r(x) = 
p(x + L) and taking L — > gives —aL 2 lp(j>)j c 2. Another measure is the 
Wooters measure [HI] 



W(p(x), r(x)) = 




For r(x) = p(x + L) one obtains W 2 — ► L 2 Ip(p)/4 as L —>■ 0. Thus one 
may potentially use either R a or W 2 in the variational action and obtain a 
nonlinear Schrodinger equation. Separability of the equation for multiparticle 
states will be automatic when one uses a sum of information measures, one 
for each coordinate degree of freedom as discussed in the text. However 
both the Renyi measure and the (square of) the Wooters measure create 
problems. Firstly, the measures are not integrals over a density as one would 
like for an action. So the equations of motion that follow will be highly 
nonlocal, containing integrals, and the weak superposition principle will not 
hold: States with negligible overlap will influence each other strongly. 

Another drawback with the resultant nonlocal equations is that they will 
not be homogeneous, that is, scaling a solution of the equations by a constant 
does not give another solution so that normalisation and interpretation of 
states after a measurement process would be problematic, as discussed by 
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Kibble [T^j. For the equations to be homogeneous, the measures used for the 
variational action (in the P, S variables) should scale like A 2 when ip — > 
Note that here we have examples of equations that are not homogeneous yet 
are separable, in contradistinction to some other scenarios |15j . 

If one is looking for deformations of the linear Schrodinger equation, then 
the weak superposition principle and homogeniety are desirable criteria as 
they allow for easy interpretation of solutions. Therefore according to those 
criteria the information measures above can be eliminated. On other hand if 
one were interested in more general dynamics that might potentially be rele- 
vant at short distance scales then these and many other information measures 
might still be valid. 

Thus here we are interested in an information measure which satisfies the 
following conditions: 

• [CI] The information measure should be positive definite and of the 
form G(p(x); L) = J p H(p; L) dxdt where if is a function of the prob- 
ability p(x, t) and is L some parameter. This form ensures the validity 
of the weak superposition principle in the equations of motion. 

• [C2] H should be invariant under scaling, H(Xp;L) = H(p;L). This 
allows solutions to be (re) normalised. 

• [C3] G(p; L — > 0) should give the Fisher information measure at lowest 
order. This allows the linear Schrodinger equation to be recovered as 
an approximation. As discussed in the text, separability of the multi- 
particle equation is a secondary consequence of this condition when an 
appropriate sum of measures is used to reproduce the correct form of 
the linear Schrodinger equation in the limit L — > 0. 

Are there any information measures that satisfy the conditions [C1-C3] 
other than the already discussed Kullback-Leibler distance? (It must be 
emphasized that one is interested in genuine information measures which 
have some reasonable interpretation and properties j3] rather than arbitrary 
functions which only satisfy the above listed conditions, of which there are 
an infinite number). 

Indeed there are a quite a few, the best known probably being a "q- 
deformation" of the Kullback-Liebler distance. A generalised entropy that 
has been much studied in recent years is Tsallis entropy, independently dis- 
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covered earlier by Havrda and Chavrat [58] . 



^l = ^-Jv\l-V l ' q ). (75) 

It has been used to describe nonextensive statistics and is formally a 
modification of the usual Gibbs-Shannon measure through the use of a de- 
formation parameter q, yielding the latter measure as q — > 1. The q-deformed 
Kullback-Liebler measure is 

I ( A(P,r) = - J p\n q ^ (76) 
where the deformed logarithm is defined by 

K*/ = ^-7-- (77) 

The q-deformed Kullback-Liebler measure reduces to the usual measure as 
q — > 1 while for a reference distribution r(x) = p(x + L) one gets 

4\{p,r) - -^J,(p(,)) (78) 

as L — > 0. A separable q-deformed multiparticle equation can be obtained 
by using the action 

Nil 



= ~f dx Nd dt Y: £ iP {x) In, ^\ , (79) 



with q > 0, instead of eq. (j38j) in the variational procedure. Note the deformed 
logarithm, which is actually a rational function as defined above, and the 
factor 1/q. The other symbols have the same interpretation as before. In 
particular, 

h 2 

£iL 2 = (no sum on i) . (80) 

4m (i ) 

The nonlinear Schrodinger equation that results is as in ()41|) but with a 
q-deformed quantum potential 



i Nd 

Qnl = ~y^.£j 
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(81) 
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As q — > 1 the previous results are recovered. The q-deformation procedure 
extends obviously to the relativistic case. 

Interestingly, the g-deformed information measure is also related to some 
common statistical measures of "goodness of fit". For example, for q = 2 jl], 
in discrete variables, J^[(p,r) = -NYXfe ~ fo) 2 / fo = -Nx% where f e are 
the expected frequencies, f D the observed frequencies, N the number of data 
points and Xn * s Neyman's chi-square function. 

7.1 Regularity and interpolation 

As mentioned, several measures satisfy the properties [C1-C3] listed above, 
mostly various one or two-parameter deformations of the Kullback-Liebler 
measure. These will then give rise to classes of nonlinear equations all of 
which contain a nonzero length parameter with consequent spacetime sym- 
metry breaking. Thus there is no qualitative change in the conclusions of 
this paper. Those classes of equations might be useful for some studies, for 
example to see if the linear Schrodinger equation is in some sense a "fixed 
point" in the space of deformed equations. Certainly this is suggested by the 
fact that the Fisher information is contained in each of them in some limit. 

However all the measures mentioned above, including the Kullback-Libeler 
mesaure, have one technical problem. This is the occurence of singularities 
in the equation of motion, for example when p(x) or p(x + L) vanishes in 
(J2(J|) . It is conceivable that in some circumstances such singularities might be 
physically interesting but in general they are a nuisance. Therefore one other 
condition that one may place on a physically useful information measure is: 

• [C4] The information measure should not lead to singularities in the 
equation of motion. 

The following measure 9 

— J p(x) In 

with < a < 1 satisfies the additional condition of regularity as we shall 
see below. For a = 1 this measure reduces to the Kullback-Liebler measure. 
When used in the variational action with r(x) = p(x + L) it will again lead 

9 Takcn from problem number 6 in Section (7.2.7) of Ref.0]. 



p(x) 

(1 — a)p(x) + ar(x) 



dx 



(82) 
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to a nonlinear Schrodinger equation for which the weak superpostion holds 
and for which solutions may be scaled. 

However a more useful form for the equations follow if one chooses r(x) to 
depend on the dimensionless parameter too, r(x) = p(x + aL) as is presently 
explained. Consider therefore 



1 . Nd 

Ml EE — / dx Nd dtY,£iP{x) 
^ ^ i=l 



where now 



In 



p(x) 



rj)p(x) + r]Pi'(x) 



(83) 
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and where the dimensionless parameter rj takes values < rj < 1. This 
measure gives rise to the generalised potential 
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+ r]p 
(85) 

to be used in the separable nonlinear Schrodinger equation (}4*Tj) instead of 
(}4*3*|) and where (j4*0|) holds. One can now see from the ^-regularised potential 
that there are no singularities as this would require, for example, that p and 
Pi> vanish simultaneously but in that case the numerator of the relevant term 
vanishes too. 

The nonlinear Schrodinger equation with (|85j) will interpolate between 
the fully nonlinear theory at r\ — 1 and the usual linear quantum mechanics 
at 7] — 0. Notice that at 77 = one simultaneously recovers the Fisher infor- 
mation measure, the linear theory, and the spacetime symmetries. However 
even in this limit L is nonzero because the constraint (J4U)) is still opertive: L 
becomes invisible but not zero. Unlike rj, the scale L is dimensionful and the 
limit L —>■ can be performed in some units only when the constraint (J4*U|) 
is adhered to. 

One may construct other information measures with the properties [Cl- 
C4] but Ml is probably the simplest, involving a single dimensionless free 
parameter, 7], that performs the dual role of regulating singularities and al- 
lowing interpolation between the linear and nonlinear theories. The param- 
eter^) L{ with dimension of length appear always, proportional to h. The 
procedure extends obviously to the relativistic case. 
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8 Higher derivatives, nonlinearity and nonlo- 
cality 

As noted in the examples, the length scale L appears naturally, breaking 
spacetime symmetries and causing nonlinearities.. Thus within the context 
of the maximum entropy principle where Kullback-Liebler type information 
measures are used, nonlinearity cannot be avoided except in some limit where 
L is scaled appropriately or some other interpolating parameter like rj in (|85j) 
is tuned to vanish. When the nonlinearity is eliminated so is the symmetry 
breaking. That is, it is only when one uses the Fisher measure as in Ref.[7] 
that an exactly linear quantum theory without Lorentz symmetry breaking 
is obtained. Thus one would like to understand why the Fisher information 
measure has such a privileged role in the usual quantum theory. 

As discussed earlier, the variation of the Fisher information measure gives 
the usual quantum potential in the equations of motion for the hydrody- 
namic variables p and S that are related to the Schrodinger wavefunction 
by if> — \/P e tS l n . The structure of the usual quantum potential in turn is 
completely specified by the structure of the kinetic energy operator in linear 
quantum mechanics as seen from Eqs. (j3|6p . In the coordinate representation 
the kinetic energy operator p 2 involves second order derivatives as is common 
to equations of motion in physics. Therefore from this perspective one sees 
that the special role played by the Fisher information measure is tied to the 
use of second order derivatives, and no more, for the kinetic terms in the 
linear Schrodinger's equation. 

Deviations from the above norm can be expected and understood heuris- 
tically as follows. If for some reason the spacetime manifold is not smooth 
at short distances, L, then one would expect higher order derivatives in the 
basic equations, including those of quantum mechanics. Let us assume first 
that the short-distance Schrodinger equation is linear in the wavefunction, 
if), then in terms of the hydrodynamic variables p, S this would result in a 
quantum potential different from the usual one. In terms of the information 
theoretic approach this translates into using a different information measure 
than the Fisher measure. But any information measure other than the Fisher 
measure implies symmetry breaking and nonlinearities, as we have seen. 

Another heuristic argument for symmetry breaking is to note that if in the 
equation p 2 is still the kinetic energy operator then higher order derivatives 
would mean a dramatic change in the generator for translations, p 7^ —ihdi. 
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Thus in the qualitative scenario sketched here one expects the Fisher 
information measure to be only an approximate quantifier that should be re- 
placed by a fuller measure which takes into account higher order derivatives 
and symmetry breaking, as would be expected at short distances. Nonlinear- 
ities in the Schrodinger equation are then a consequence of using the fuller 
measure. 

It is instructive to view the problem from another perspective, that of 
the nonlinear equations that have been found in previous sections. Here one 
has the conventional kinetic energy term but derivatives to all higher orders, 
with nonlinearities, in the quantum potential. Suppose one could map these 
equations to a physically equivalent set described by a linear Schrodinger 
equation. Then it must be, from the link between the kinetic energy oper- 
ator and the quantum potential, that this linear equation contains a highly 
nonlocal kinetic energy operator that might be approximated by higher order 
derivatives. Thus it is heuristically conceivable that the nonlinear equations 
with symmetry breaking that we have found are related to linear equations 
that are highly nonlocal. For practical applications where one is looking 
for small deviations and comparing experiments with theory, the nonlinear 
representation is clearly preferable. 

The above discussion is not totally hypothetical. A concrete example of 
how a physical system can look very different through a change in variables 
has already been presented: It is the usual Schrodinger equation. In terms 
of ip it is a beautifully linear equation. But in terms of the hydrodynamic 
variables p, S it is a coupled set of nonlinear differential equations with some 
nonpolynomial terms like 

9 Summary of main results 

Using the principle of maximum entropy and thus pushing the arguments of 
Ref.JTj to the extreme, a nonlinear quantum mechanics has been obtained 
which simultaneously breaks Lorentz symmetry. Both the nonlinearity and 
the symmetry breaking are determined by a nonzero length scale L that 
appears naturally in the definition of the Kullback-Liebler relative entropy 
(dnj). The length scale depends on Planck's constant so it is natural to 
associate L with the Planck length, but it might possibly be a new and 
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larger scale related to the size of elementary particles 10 ; some crude estimates 
suggest that the length might be close to the GUT scale. 

It is intriguing that a length scale has emerged without any considerations 
of gravity or string theory which is where the Planck length traditionally 
appears jHHl- Furthermore, although parity and time- reversal symmetries 
can be preserved even for nonzero length scales within the present formalism, 
their breaking due to the length scale is just as easily accomodated. 

The phenomena of nonzero length scale, breaking of spacetime symme- 
tries, and nonlinear quantum mechanics are generic outcomes of any relevant 
information theoretic measure that is used in the inference procedure. Only 
specially tuned values of some parameters in some measures can give rise to a 
linear theory and to no symmetry breaking. Even then, as seen in the example 
of Section (7.1), the length scale does not vanish but rather becomes hidden 
because of the relation pUj). This suggests that the nonlinear equation ob- 
tained here is generic and might be relevant for intermediate investigations 
into the possible simultaneous emergent nature of quantum mechanics and 
spacetime symmetries. 

The nonlinear theory makes some falsifiable predictions. Firstly, since 
it suggests that the linearity of quantum mechanics and the symmetries of 
spacetime are intimately linked, the violation of one is expected to lead to 
the simultaneous violation of the other. Thus, for example, if and when 
low energy experiments searching for nonlinearity in quantum mechanics do 
succeed, then experiments searching for Lorentz violation should succeed too 
in the same energy range. Of course ideally those experiments should be 
open to both possibilities occuring simultaneously. 

Many more experiments and phenomenological studies have focussed on 
possible Lorentz and CPT violation |T7l [TH] . One much discussed possibility 
is that certain astrophysical events might be explainable in terms of modified 
dispersion relations due to the violation or modification of Lorentz invariance 
|^Uj . However the nonlinear Schrodinger equation obtained here still allows 
the conventional plane waves and also likely contains deformed waves. Thus 
a second prediction is: At high energies states with the usual dispersion re- 
lations should exist and, if the nonlinear equation supports such waves, also 
those with modified dispersion relations. That is, Lorentz symmetry break- 
ing of the equations of motion does not necessarily imply that all quantum 

10 As discussed in Section (4.1), the Li can be chosen to be spacetime dependent in 
general, Li(x, t), suggestive of new dynamical degrees of freedom at short scales. 
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states must have modified dispersion relations. On the other hand, a math- 
ematically pure plane wave might not exist physically so deformed relations 
might be more reasonably expected in reality 

It has been conjectured that the phenomena of neutrino mass and oscil- 
lations is due to quantum nonlinearities that are tied to Lorentz violation. 
Some unusual energy dependence has been noted in this approach and that 
might be one way of distinguishing the suggestion of this paper from other 
proposals. 

There are some issues that deserve further investigation. The nonlinear 
equation of this paper contains a periodic zero-mode solution which clearly 
reflects the underlying cellular structure of space. The experimental impli- 
cations of this are as yet unclear though it might speculatively be relevant in 
the context of astrophysics and cosmology. More generally, quantum nonlin- 
earities in the equations of motion of particles will obviously impact on the 
problems of dark matter /energy in cosmology. 

The nonlinear equation (j41j) suggests a natural geometric form of parity 
and time-reversal violation related to the presence of the length scale L. It 
would be interesting to study the relation of such dicrete symmetry breakings 
to the conventional discussions. Thus there is the possibility that nonlinear 
quantum mechanics, CPT breaking and Lorentz violation are all linked. 

Finally the effect of the nonlinear terms on the spreading of wavepackets, 
decoherence of superpositions, and solution of the nonlinear equation with 
various potentials should be studied [33]. 

10 Perspectives 

The construction of physical theories is traditionally driven by various con- 
straints such as: Fundamental lagrangians should not contain third or higher 
order derivatives; symmetry; renormalizability; gauge invariance and others. 
Usually there are also various implicit assumptions such as the continuity 
and differentiability of spacetime. Sometimes the constraints are imposed 
to restrict the possibilities, to simplify the mathematics or to conform with 
phenomenology. However there is now a growing willingness to explore the- 
ories where some of the conventional constraints or assumptions have been 
relaxed. This is mostly driven either by the desire to understand the origin 
of those constraints, or the need to unify quantum mechanics with gravity, 
or as a probe of potential new physics around the corner. 
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We are quite comfortable with Galilean symmetries as they appear rea- 
sonable to us, while Lorentzian symmetries are less obvious but were already 
encoded in Maxwell's equations before Einstein obtained them from more 
fundamental postulates. There have been many suggestions that these sym- 
metries of the continuum might only be approximations that do not hold 
at smaller length scales. Serious efforts to search for violations of Lorentz 
invariance are under way. On the other hand, the linearity of quantum me- 
chanics has been a long standing puzzle that has engaged many authors, 
with several suggestions that it might be an approximation of a nonlinear 
theory. Experiments that have been performed to search for departures from 
linearity have to date concluded that the deformation parameters must be 
tiny if not zero. 

If, for example, quantum effects lead to fluctuations of the geometry of 
spacetime at small scales, and perhaps even render the notion of space and 
time meaningless, then one needs a guiding principle other than the usual one 
based on symmetries that has sucessfully led us so far to the Standard Model 
of particle physics. Wheeler has forcefully expressed his "It from Bits" view 
that an information theoretic approach is the way to go in uncovering the 
basic foundation of our laws [H]. Jaynes pQ and others have shown how an 
inference method can be used successfully in statistical mechanics and many 
other fields. Frieden and Reginatto used a similar approach in obtaining the 
usual Schrodinger equation. In this paper those initiatives have been pushed 
further to see what new physics might lie below the surface. A surprising but 
pleasing conclusion has been reached, linking two apparently unconnected 
aspects of physics: The linearity of quantum mechanics and our accepted 
spacetime symmetries. 

The excellent spacetime symmetries and the superb linearity of quantum 
mechanics appear in current physics as two separate pieces of fact. It is 
proposed here that those facts are actually intrinsically linked: Both are 
approximations controlled by the same deformation parameter, a nonzero 
length scale. The main argument has relied on the assumption that a common 
information-theoretic principle be used to infer the dynamics of quantum 
theory — the information measure should be similar to that used in the 
maximum entropy principle. Other supporting and suggestive arguments 
linking the linearity of quantum mechanics with causality have been noted 
in Section (4.2). 

Of course one can construct a nonlinear Schrodinger equation which re- 
spects spacetime symmetries or a linear equation that breaks those symme- 
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tries, as many authors have done within various other contexts. One can 
also construct any number of equations that are simultaneously nonlinear 
and break spacetime symmetries. What has been done here is to use the 
framework of information theory that has been successfully used elsewhere, 
to restrict the possible forms a generalised Schrodinger equation can take, 
resulting in a link between two seemingly different aspects of conventional 
physics. 

The conclusion reached here gives prominence to information theoretic ar- 
guments and has ignored any initial symmetry constraints. Alternatively if 
one insisted on the fundamental validity of current spacetime symmetries and 
imposed them at the beginning, the results obtained here could be reinter- 
preted as : "Information theoretic arguments supplemented with spacetime 
symmetry constraints imply the linearity of quantum mechanics" . Thus the 
observed linearity of quantum mechanics would be a consequence of "hidden 
symmetries" — hidden not because they were not manifest in the equations 
but because they were not previously seen to be operational in determining 
the linearity! Whichever perspective one adopts, that of the previous para- 
graph or this, the main point is the linking of spacetime symmetries with the 
linearity of quantum mechanics. 

However the imposition of symmetry constraints at the outset within 
the information theoretic approach would allow the use of only the Fisher 
information measure. As discussed in Section (8) this can be understood as 
due to certain other assumptions that are usually made about the structure 
of quantum mechanics, with the conclusion that linearity and symmetries 
would anyway be expected to fail at short distance scales. 

Ultimately these arguments have to confront experiments, and some ten- 
tative predictions and suggestions were made in Section(9). For example, 
current searches for violations of Lorentz invariance might also be unwit- 
tingly searching for violations of the linearity of quantum theory. If data 
is limited, it is conceivably possible to attribute it wholly to violations of 
Lorentz invariance or wholly to violations of linearity of quantum mechanics, 
so a more elaborate analysis would be required to see if both violations are 
indeed occuring simultaneously. The unusual energy dependence found here 
might be one signature. 

Certainly it is not being suggested that the nonlinear equation obtained 
in this paper is the fundamental equation underlying quantum matter and 
spacetime but rather, as discussed in Section (8), it might be one approx- 
imation in a sequence leading to a final theory (if a unique theory exists!) 
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underlying space, time and matter. The equation may take different guises 
in different variables. If one insists on using the wavefunction ip then non- 
linearities and nonlocalities are to be expected at short distances. However 
at a more basic level there might be different variables in terms of which 
the equations take a more pleasing form. A final theory might involve rad- 
ically different concepts, as discussed for example in Ref. |32]j which at a 
coarser level give rise to our perception of linear quantum matter and smooth 
spacetime. 

In some sense, the information theoretic method used here to uncover 
a more basic structure to our current theories may be viewed as an inverse 
procedure to the Wilson renormalisation scheme where one obtains successive 
effective low energy theories by integrating out the unobserved high energy 
modes. In the inference method discussed here, one is "integrating back 
in" the fluctuations. Can an inference procedure also be iterated to obtain 
successively better theories? 

Appendix: Fields and strings 

Formally the inference procedure described for nonrelativistic quantum me- 
chanics can be applied to quantum field theory. One works in the Schrodinger 
representation and writes the functional Schrodinger equation jTDJ H3] for 
the wave-functional $?[(f)(x),t] which describes the state of time-independent 
fields 4>{x). The usual transformation \1/ = Re iS l h produces the continuity 
equation and a quantum Hamilton- Jacobi equation. The quantum potential, 
which is of Fisher form, can then be generalised as usual leading finally to a 
nonlinear functional Schrodinger equation. 

Apart from the usual possible ambiquities with functional equations, the 
main problems are relating the generalised results to the single particle the- 
ories considered so far and extracting some physical consequences 

It is possible that the procedure just described for fields will reduce to dif- 
ferent single particle theories from that described in the main text. However 
this is not surprising as one is not adhering to the conventional quantisation 
approach. Indeed, one might define a nonlinear quantum field theory in sev- 
eral possibly inequivalent ways. For example, the nonlinear Klien-Gordon 
equation of the text can be promoted to an operator equation (Heisenberg 
representation). Or instead the can be treated as a field and the nonlinear 
Lagrangian then used directly in a path-integral. The path-integral ansatz 
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probably retains the closest link with the nonlinear single particle equations 
considered so far. 

It might be interesting to study also the nonlinearisation of quantum 
bosonic strings [33]. The obvious question is whether the tachyonic state of 
the linear theory persists. 
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